A Metric for Music Notation Transcription Accuracy
نویسندگان
چکیده
Automatic music transcription aims at transcribing musical performances into music notation. However, most existing transcription systems only focus on parametric transcription, i.e., they output a symbolic representation in absolute terms, showing frequency and absolute time (e.g., a pianoroll representation), but not in musical terms, with spelling distinctions (e.g., A[ versus G]) and quantized meter. Recent attempts at producing full music notation output have been hindered by the lack of an objective metric to measure the adherence of the results to the ground truth music score, and had to rely on time-consuming human evaluation by music theorists. In this paper, we propose an edit distance, similar to the Levenshtein Distance used for measuring the difference between two sequences, typically strings of characters. The metric treats a music score as a sequence of sets of musical objects, ordered by their onsets. The metric reports the differences between two music scores based on twelve aspects: barlines, clefs, key signatures, time signatures, notes, note spelling, note durations, stem directions, groupings, rests, rest duration, and staff assignment. We also apply a linear regression model to the metric in order to predict human evaluations on a dataset of short music excerpts automatically transcribed into music notation.
منابع مشابه
Clavision: visual automatic piano music transcription
One important problem in Music Information Retrieval is Automatic Music Transcription, which is an automated conversion process from played music to a symbolic notation such as sheet music. Since the accuracy of previous audiobased transcription systems is not satisfactory, we propose an innovative visual-based automatic music transcription system named claVision to perform piano music transcri...
متن کاملA Transcription System from MusicXML Format to Braille Music Notation
The Internet enables us to freely access music as recorded sound and even music scores. For the visually impaired, music scores must be transcribed from computer-based musical formats to Braille music notation. This paper proposes a transcription system from the MusicXML format to Braille music notation using a structural model of Braille music notation. The resultant Braille scores inspected b...
متن کاملTranscribing Human Piano Performances into Music Notation
Automatic music transcription aims to transcribe musical performances into music notation. However, existing transcription systems that have been described in research papers typically focus on multi-F0 estimation from audio and only output notes in absolute terms, showing frequency and absolute time (a piano-roll representation), but not in musical terms, with spelling distinctions (e.g., A[ v...
متن کاملIntroduction to Music Transcription
Music transcription refers to the analysis of an acoustic musical signal so as to write down the pitch, onset time, duration, and source of each sound that occurs in it. In Western tradition, written music uses note symbols to indicate these parameters in a piece of music. Figures 1–2 show the notation of an example music signal. Omitting the details, the main conventions are that time flows fr...
متن کاملCreating an XML Vocabulary for Encoding Lute Music
We describe the development of an XML representation, called TabXML, for encoding historical sources of lute music. These sources employ a special notation type, tablature, that is very hard to understand for non-lutenists. This paper discusses several issues in creating TabXML: 1. what to represent: the notational meaning or the text of the tablature, and how to represent it; 2. an analysis of...
متن کامل